Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 158 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 14.9 KiB |
| Average record size in memory | 96.8 B |
Variable types
| Categorical | 2 |
|---|---|
| Numeric | 10 |
Country has a high cardinality: 158 distinct values | High cardinality |
Happiness Rank is highly correlated with Happiness Score and 5 other fields | High correlation |
Happiness Score is highly correlated with Happiness Rank and 5 other fields | High correlation |
Economy (GDP per Capita) is highly correlated with Happiness Rank and 3 other fields | High correlation |
Family is highly correlated with Happiness Rank and 3 other fields | High correlation |
Health (Life Expectancy) is highly correlated with Happiness Rank and 3 other fields | High correlation |
Freedom is highly correlated with Happiness Rank and 1 other fields | High correlation |
Dystopia Residual is highly correlated with Happiness Rank and 1 other fields | High correlation |
Happiness Rank is highly correlated with Happiness Score and 5 other fields | High correlation |
Happiness Score is highly correlated with Happiness Rank and 5 other fields | High correlation |
Economy (GDP per Capita) is highly correlated with Happiness Rank and 3 other fields | High correlation |
Family is highly correlated with Happiness Rank and 4 other fields | High correlation |
Health (Life Expectancy) is highly correlated with Happiness Rank and 3 other fields | High correlation |
Freedom is highly correlated with Happiness Rank and 2 other fields | High correlation |
Dystopia Residual is highly correlated with Happiness Rank and 1 other fields | High correlation |
Happiness Rank is highly correlated with Happiness Score and 3 other fields | High correlation |
Happiness Score is highly correlated with Happiness Rank and 3 other fields | High correlation |
Economy (GDP per Capita) is highly correlated with Happiness Rank and 2 other fields | High correlation |
Family is highly correlated with Happiness Rank and 1 other fields | High correlation |
Health (Life Expectancy) is highly correlated with Happiness Rank and 2 other fields | High correlation |
Health (Life Expectancy) is highly correlated with Happiness Score and 4 other fields | High correlation |
Happiness Score is highly correlated with Health (Life Expectancy) and 7 other fields | High correlation |
Freedom is highly correlated with Happiness Score and 4 other fields | High correlation |
Trust (Government Corruption) is highly correlated with Happiness Score and 2 other fields | High correlation |
Generosity is highly correlated with Region | High correlation |
Happiness Rank is highly correlated with Health (Life Expectancy) and 6 other fields | High correlation |
Region is highly correlated with Health (Life Expectancy) and 7 other fields | High correlation |
Family is highly correlated with Health (Life Expectancy) and 4 other fields | High correlation |
Dystopia Residual is highly correlated with Happiness Score and 1 other fields | High correlation |
Economy (GDP per Capita) is highly correlated with Health (Life Expectancy) and 5 other fields | High correlation |
Country is uniformly distributed | Uniform |
Happiness Rank is uniformly distributed | Uniform |
Country has unique values | Unique |
Economy (GDP per Capita) has unique values | Unique |
Family has unique values | Unique |
Freedom has unique values | Unique |
Generosity has unique values | Unique |
Dystopia Residual has unique values | Unique |
Reproduction
| Analysis started | 2021-07-23 17:53:26.448273 |
|---|---|
| Analysis finished | 2021-07-23 17:53:46.121619 |
| Duration | 19.67 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 158 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 KiB |
| Uzbekistan | 1 |
|---|---|
| Jamaica | 1 |
| Qatar | 1 |
| Bosnia and Herzegovina | 1 |
| Dominican Republic | 1 |
| Other values (153) |
Length
| Max length | 24 |
|---|---|
| Median length | 7 |
| Mean length | 8.189873418 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1294 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 158 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Switzerland |
|---|---|
| 2nd row | Iceland |
| 3rd row | Denmark |
| 4th row | Norway |
| 5th row | Canada |
Common Values
| Value | Count | Frequency (%) |
| Uzbekistan | 1 | 0.6% |
| Jamaica | 1 | 0.6% |
| Qatar | 1 | 0.6% |
| Bosnia and Herzegovina | 1 | 0.6% |
| Dominican Republic | 1 | 0.6% |
| Mali | 1 | 0.6% |
| Hong Kong | 1 | 0.6% |
| China | 1 | 0.6% |
| Myanmar | 1 | 0.6% |
| Portugal | 1 | 0.6% |
| Other values (148) | 148 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| united | 3 | 1.6% |
| republic | 3 | 1.6% |
| south | 2 | 1.1% |
| cyprus | 2 | 1.1% |
| congo | 2 | 1.1% |
| and | 2 | 1.1% |
| brazil | 1 | 0.5% |
| uzbekistan | 1 | 0.5% |
| malaysia | 1 | 0.5% |
| senegal | 1 | 0.5% |
| Other values (168) | 168 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 201 | |
| i | 114 | 8.8% |
| n | 106 | 8.2% |
| e | 83 | 6.4% |
| r | 77 | 6.0% |
| o | 75 | 5.8% |
| l | 48 | 3.7% |
| t | 47 | 3.6% |
| u | 44 | 3.4% |
| s | 40 | 3.1% |
| Other values (43) | 459 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1079 | |
| Uppercase Letter | 183 | 14.1% |
| Space Separator | 28 | 2.2% |
| Open Punctuation | 2 | 0.2% |
| Close Punctuation | 2 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 201 | |
| i | 114 | |
| n | 106 | |
| e | 83 | 7.7% |
| r | 77 | 7.1% |
| o | 75 | 7.0% |
| l | 48 | 4.4% |
| t | 47 | 4.4% |
| u | 44 | 4.1% |
| s | 40 | 3.7% |
| Other values (16) | 244 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 20 | 10.9% |
| C | 17 | 9.3% |
| M | 15 | 8.2% |
| B | 14 | 7.7% |
| A | 13 | 7.1% |
| T | 11 | 6.0% |
| L | 10 | 5.5% |
| I | 9 | 4.9% |
| K | 9 | 4.9% |
| N | 8 | 4.4% |
| Other values (14) | 57 |
Space Separator
| Value | Count | Frequency (%) |
| 28 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1262 | |
| Common | 32 | 2.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 201 | |
| i | 114 | 9.0% |
| n | 106 | 8.4% |
| e | 83 | 6.6% |
| r | 77 | 6.1% |
| o | 75 | 5.9% |
| l | 48 | 3.8% |
| t | 47 | 3.7% |
| u | 44 | 3.5% |
| s | 40 | 3.2% |
| Other values (40) | 427 |
Common
| Value | Count | Frequency (%) |
| 28 | ||
| ( | 2 | 6.2% |
| ) | 2 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1294 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 201 | |
| i | 114 | 8.8% |
| n | 106 | 8.2% |
| e | 83 | 6.4% |
| r | 77 | 6.0% |
| o | 75 | 5.8% |
| l | 48 | 3.7% |
| t | 47 | 3.6% |
| u | 44 | 3.4% |
| s | 40 | 3.1% |
| Other values (43) | 459 |
| Distinct | 10 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 KiB |
| Sub-Saharan Africa | |
|---|---|
| Central and Eastern Europe | |
| Latin America and Caribbean | |
| Western Europe | |
| Middle East and Northern Africa | |
| Other values (5) |
Length
| Max length | 31 |
|---|---|
| Median length | 18 |
| Mean length | 21.35443038 |
| Min length | 12 |
Characters and Unicode
| Total characters | 3374 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Western Europe |
|---|---|
| 2nd row | Western Europe |
| 3rd row | Western Europe |
| 4th row | Western Europe |
| 5th row | North America |
Common Values
| Value | Count | Frequency (%) |
| Sub-Saharan Africa | 40 | |
| Central and Eastern Europe | 29 | |
| Latin America and Caribbean | 22 | |
| Western Europe | 21 | |
| Middle East and Northern Africa | 20 | |
| Southeastern Asia | 9 | 5.7% |
| Southern Asia | 7 | 4.4% |
| Eastern Asia | 6 | 3.8% |
| Australia and New Zealand | 2 | 1.3% |
| North America | 2 | 1.3% |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| and | 73 | |
| africa | 60 | |
| europe | 50 | |
| sub-saharan | 40 | 8.3% |
| eastern | 35 | 7.3% |
| central | 29 | 6.0% |
| america | 24 | 5.0% |
| latin | 22 | 4.6% |
| asia | 22 | 4.6% |
| caribbean | 22 | 4.6% |
| Other values (10) | 105 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 466 | |
| r | 341 | 10.1% |
| 324 | 9.6% | |
| n | 280 | 8.3% |
| e | 271 | 8.0% |
| t | 176 | 5.2% |
| i | 172 | 5.1% |
| d | 115 | 3.4% |
| s | 109 | 3.2% |
| u | 108 | 3.2% |
| Other values (19) | 1012 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2561 | |
| Uppercase Letter | 449 | 13.3% |
| Space Separator | 324 | 9.6% |
| Dash Punctuation | 40 | 1.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 466 | |
| r | 341 | |
| n | 280 | |
| e | 271 | |
| t | 176 | 6.9% |
| i | 172 | 6.7% |
| d | 115 | 4.5% |
| s | 109 | 4.3% |
| u | 108 | 4.2% |
| o | 88 | 3.4% |
| Other values (8) | 435 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 108 | |
| E | 105 | |
| S | 96 | |
| C | 51 | |
| N | 24 | 5.3% |
| L | 22 | 4.9% |
| W | 21 | 4.7% |
| M | 20 | 4.5% |
| Z | 2 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 324 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 40 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3010 | |
| Common | 364 | 10.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 466 | |
| r | 341 | 11.3% |
| n | 280 | 9.3% |
| e | 271 | 9.0% |
| t | 176 | 5.8% |
| i | 172 | 5.7% |
| d | 115 | 3.8% |
| s | 109 | 3.6% |
| u | 108 | 3.6% |
| A | 108 | 3.6% |
| Other values (17) | 864 |
Common
| Value | Count | Frequency (%) |
| 324 | ||
| - | 40 | 11.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3374 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 466 | |
| r | 341 | 10.1% |
| 324 | 9.6% | |
| n | 280 | 8.3% |
| e | 271 | 8.0% |
| t | 176 | 5.2% |
| i | 172 | 5.1% |
| d | 115 | 3.4% |
| s | 109 | 3.2% |
| u | 108 | 3.2% |
| Other values (19) | 1012 |
Happiness Rank
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIFORM| Distinct | 157 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 79.49367089 |
| Minimum | 1 |
|---|---|
| Maximum | 158 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 8.85 |
| Q1 | 40.25 |
| median | 79.5 |
| Q3 | 118.75 |
| 95-th percentile | 150.15 |
| Maximum | 158 |
| Range | 157 |
| Interquartile range (IQR) | 78.5 |
Descriptive statistics
| Standard deviation | 45.7543631 |
|---|---|
| Coefficient of variation (CV) | 0.5755724021 |
| Kurtosis | -1.199932134 |
| Mean | 79.49367089 |
| Median Absolute Deviation (MAD) | 39.5 |
| Skewness | 0.0004184693238 |
| Sum | 12560 |
| Variance | 2093.461743 |
| Monotonicity | Increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 82 | 2 | 1.3% |
| 158 | 1 | 0.6% |
| 50 | 1 | 0.6% |
| 57 | 1 | 0.6% |
| 56 | 1 | 0.6% |
| 55 | 1 | 0.6% |
| 54 | 1 | 0.6% |
| 53 | 1 | 0.6% |
| 52 | 1 | 0.6% |
| 51 | 1 | 0.6% |
| Other values (147) | 147 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 158 | 1 | |
| 157 | 1 | |
| 156 | 1 | |
| 155 | 1 | |
| 154 | 1 | |
| 153 | 1 | |
| 152 | 1 | |
| 151 | 1 | |
| 150 | 1 | |
| 149 | 1 |
Happiness Score
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 157 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.375734177 |
| Minimum | 2.839 |
|---|---|
| Maximum | 7.587 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 2.839 |
|---|---|
| 5-th percentile | 3.65585 |
| Q1 | 4.526 |
| median | 5.2325 |
| Q3 | 6.24375 |
| 95-th percentile | 7.2977 |
| Maximum | 7.587 |
| Range | 4.748 |
| Interquartile range (IQR) | 1.71775 |
Descriptive statistics
| Standard deviation | 1.145010135 |
|---|---|
| Coefficient of variation (CV) | 0.212996048 |
| Kurtosis | -0.7760749386 |
| Mean | 5.375734177 |
| Median Absolute Deviation (MAD) | 0.7665 |
| Skewness | 0.09776909409 |
| Sum | 849.366 |
| Variance | 1.311048209 |
| Monotonicity | Decreasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 5.192 | 2 | 1.3% |
| 4.642 | 1 | 0.6% |
| 5.098 | 1 | 0.6% |
| 5.129 | 1 | 0.6% |
| 5.889 | 1 | 0.6% |
| 6.937 | 1 | 0.6% |
| 4.694 | 1 | 0.6% |
| 3.681 | 1 | 0.6% |
| 4.35 | 1 | 0.6% |
| 6.611 | 1 | 0.6% |
| Other values (147) | 147 |
| Value | Count | Frequency (%) |
| 2.839 | 1 | |
| 2.905 | 1 | |
| 3.006 | 1 | |
| 3.34 | 1 | |
| 3.465 | 1 | |
| 3.575 | 1 | |
| 3.587 | 1 | |
| 3.655 | 1 | |
| 3.656 | 1 | |
| 3.667 | 1 |
| Value | Count | Frequency (%) |
| 7.587 | 1 | |
| 7.561 | 1 | |
| 7.527 | 1 | |
| 7.522 | 1 | |
| 7.427 | 1 | |
| 7.406 | 1 | |
| 7.378 | 1 | |
| 7.364 | 1 | |
| 7.286 | 1 | |
| 7.284 | 1 |
Standard Error
Real number (ℝ≥0)
| Distinct | 153 |
|---|---|
| Distinct (%) | 96.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.04788474684 |
| Minimum | 0.01848 |
|---|---|
| Maximum | 0.13693 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0.01848 |
|---|---|
| 5-th percentile | 0.0310355 |
| Q1 | 0.0372675 |
| median | 0.04394 |
| Q3 | 0.0523 |
| 95-th percentile | 0.07926 |
| Maximum | 0.13693 |
| Range | 0.11845 |
| Interquartile range (IQR) | 0.0150325 |
Descriptive statistics
| Standard deviation | 0.01714617856 |
|---|---|
| Coefficient of variation (CV) | 0.3580718222 |
| Kurtosis | 5.989346403 |
| Mean | 0.04788474684 |
| Median Absolute Deviation (MAD) | 0.00728 |
| Skewness | 1.983439396 |
| Sum | 7.56579 |
| Variance | 0.0002939914391 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.03751 | 2 | 1.3% |
| 0.04934 | 2 | 1.3% |
| 0.0378 | 2 | 1.3% |
| 0.04394 | 2 | 1.3% |
| 0.05051 | 2 | 1.3% |
| 0.03553 | 1 | 0.6% |
| 0.06107 | 1 | 0.6% |
| 0.03607 | 1 | 0.6% |
| 0.03328 | 1 | 0.6% |
| 0.05069 | 1 | 0.6% |
| Other values (143) | 143 |
| Value | Count | Frequency (%) |
| 0.01848 | 1 | |
| 0.01866 | 1 | |
| 0.02043 | 1 | |
| 0.02424 | 1 | |
| 0.02596 | 1 | |
| 0.02799 | 1 | |
| 0.03077 | 1 | |
| 0.03084 | 1 | |
| 0.03107 | 1 | |
| 0.03135 | 1 |
| Value | Count | Frequency (%) |
| 0.13693 | 1 | |
| 0.11068 | 1 | |
| 0.10895 | 1 | |
| 0.09811 | 1 | |
| 0.09438 | 1 | |
| 0.08742 | 1 | |
| 0.08658 | 1 | |
| 0.08096 | 1 | |
| 0.07896 | 1 | |
| 0.07832 | 1 |
Economy (GDP per Capita)
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIQUE| Distinct | 158 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8461372152 |
| Minimum | 0 |
|---|---|
| Maximum | 1.69042 |
| Zeros | 1 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.186325 |
| Q1 | 0.5458075 |
| median | 0.910245 |
| Q3 | 1.1584475 |
| 95-th percentile | 1.394645 |
| Maximum | 1.69042 |
| Range | 1.69042 |
| Interquartile range (IQR) | 0.61264 |
Descriptive statistics
| Standard deviation | 0.4031207785 |
|---|---|
| Coefficient of variation (CV) | 0.4764248296 |
| Kurtosis | -0.8669864214 |
| Mean | 0.8461372152 |
| Median Absolute Deviation (MAD) | 0.30658 |
| Skewness | -0.3175746523 |
| Sum | 133.68968 |
| Variance | 0.1625063621 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.30232 | 1 | 0.6% |
| 1.12555 | 1 | 0.6% |
| 0.93929 | 1 | 0.6% |
| 0.39753 | 1 | 0.6% |
| 0.2852 | 1 | 0.6% |
| 0.68133 | 1 | 0.6% |
| 1.02564 | 1 | 0.6% |
| 0.77042 | 1 | 0.6% |
| 0.18847 | 1 | 0.6% |
| 0.88113 | 1 | 0.6% |
| Other values (148) | 148 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 0.0153 | 1 | |
| 0.01604 | 1 | |
| 0.0694 | 1 | |
| 0.0712 | 1 | |
| 0.0785 | 1 | |
| 0.08308 | 1 | |
| 0.17417 | 1 | |
| 0.18847 | 1 | |
| 0.19073 | 1 |
| Value | Count | Frequency (%) |
| 1.69042 | 1 | |
| 1.56391 | 1 | |
| 1.55422 | 1 | |
| 1.52186 | 1 | |
| 1.459 | 1 | |
| 1.42727 | 1 | |
| 1.39651 | 1 | |
| 1.39541 | 1 | |
| 1.39451 | 1 | |
| 1.38604 | 1 |
| Distinct | 158 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9910459494 |
| Minimum | 0 |
|---|---|
| Maximum | 1.40223 |
| Zeros | 1 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.415606 |
| Q1 | 0.8568225 |
| median | 1.02951 |
| Q3 | 1.214405 |
| 95-th percentile | 1.3184715 |
| Maximum | 1.40223 |
| Range | 1.40223 |
| Interquartile range (IQR) | 0.3575825 |
Descriptive statistics
| Standard deviation | 0.272369086 |
|---|---|
| Coefficient of variation (CV) | 0.2748299271 |
| Kurtosis | 0.9188188118 |
| Mean | 0.9910459494 |
| Median Absolute Deviation (MAD) | 0.17851 |
| Skewness | -1.006893127 |
| Sum | 156.58526 |
| Variance | 0.07418491901 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.18354 | 1 | 0.6% |
| 1.12241 | 1 | 0.6% |
| 1.02507 | 1 | 0.6% |
| 1.11862 | 1 | 0.6% |
| 1.26038 | 1 | 0.6% |
| 0.85563 | 1 | 0.6% |
| 1.28548 | 1 | 0.6% |
| 0.30285 | 1 | 0.6% |
| 0.98521 | 1 | 0.6% |
| 0.67954 | 1 | 0.6% |
| Other values (148) | 148 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 0.13995 | 1 | |
| 0.30285 | 1 | |
| 0.35386 | 1 | |
| 0.38174 | 1 | |
| 0.38562 | 1 | |
| 0.41134 | 1 | |
| 0.41411 | 1 | |
| 0.41587 | 1 | |
| 0.43106 | 1 |
| Value | Count | Frequency (%) |
| 1.40223 | 1 | |
| 1.36948 | 1 | |
| 1.36058 | 1 | |
| 1.34951 | 1 | |
| 1.34043 | 1 | |
| 1.33095 | 1 | |
| 1.32261 | 1 | |
| 1.31967 | 1 | |
| 1.31826 | 1 | |
| 1.31379 | 1 |
Health (Life Expectancy)
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 157 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6302593671 |
| Minimum | 0 |
|---|---|
| Maximum | 1.02525 |
| Zeros | 1 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.1515875 |
| Q1 | 0.439185 |
| median | 0.696705 |
| Q3 | 0.8110125 |
| 95-th percentile | 0.942084 |
| Maximum | 1.02525 |
| Range | 1.02525 |
| Interquartile range (IQR) | 0.3718275 |
Descriptive statistics
| Standard deviation | 0.2470777663 |
|---|---|
| Coefficient of variation (CV) | 0.3920255362 |
| Kurtosis | -0.3939350955 |
| Mean | 0.6302593671 |
| Median Absolute Deviation (MAD) | 0.159855 |
| Skewness | -0.7053284857 |
| Sum | 99.58098 |
| Variance | 0.0610474226 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.92356 | 2 | 1.3% |
| 0.41435 | 1 | 0.6% |
| 0.8116 | 1 | 0.6% |
| 0.51466 | 1 | 0.6% |
| 0.80925 | 1 | 0.6% |
| 0.70806 | 1 | 0.6% |
| 0.7095 | 1 | 0.6% |
| 0.69805 | 1 | 0.6% |
| 0.69702 | 1 | 0.6% |
| 0.53886 | 1 | 0.6% |
| Other values (147) | 147 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 0.04776 | 1 | |
| 0.06699 | 1 | |
| 0.07566 | 1 | |
| 0.07612 | 1 | |
| 0.09131 | 1 | |
| 0.09806 | 1 | |
| 0.1501 | 1 | |
| 0.15185 | 1 | |
| 0.16007 | 1 |
| Value | Count | Frequency (%) |
| 1.02525 | 1 | |
| 1.01328 | 1 | |
| 0.99111 | 1 | |
| 0.96538 | 1 | |
| 0.95562 | 1 | |
| 0.95446 | 1 | |
| 0.94784 | 1 | |
| 0.94579 | 1 | |
| 0.94143 | 1 | |
| 0.93156 | 1 |
| Distinct | 158 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4286149367 |
| Minimum | 0 |
|---|---|
| Maximum | 0.66973 |
| Zeros | 1 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.170474 |
| Q1 | 0.32833 |
| median | 0.435515 |
| Q3 | 0.5490925 |
| 95-th percentile | 0.641588 |
| Maximum | 0.66973 |
| Range | 0.66973 |
| Interquartile range (IQR) | 0.2207625 |
Descriptive statistics
| Standard deviation | 0.1506927839 |
|---|---|
| Coefficient of variation (CV) | 0.3515808037 |
| Kurtosis | -0.4607783896 |
| Mean | 0.4286149367 |
| Median Absolute Deviation (MAD) | 0.11246 |
| Skewness | -0.4134619729 |
| Sum | 67.72116 |
| Variance | 0.02270831513 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.41466 | 1 | 0.6% |
| 0.36772 | 1 | 0.6% |
| 0.40672 | 1 | 0.6% |
| 0.43477 | 1 | 0.6% |
| 0.31767 | 1 | 0.6% |
| 0.5845 | 1 | 0.6% |
| 0.19847 | 1 | 0.6% |
| 0.46582 | 1 | 0.6% |
| 0.55884 | 1 | 0.6% |
| 0.33457 | 1 | 0.6% |
| Other values (148) | 148 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 0.07699 | 1 | |
| 0.09245 | 1 | |
| 0.10081 | 1 | |
| 0.10384 | 1 | |
| 0.1185 | 1 | |
| 0.12102 | 1 | |
| 0.15684 | 1 | |
| 0.17288 | 1 | |
| 0.1826 | 1 |
| Value | Count | Frequency (%) |
| 0.66973 | 1 | |
| 0.66557 | 1 | |
| 0.66246 | 1 | |
| 0.6598 | 1 | |
| 0.65821 | 1 | |
| 0.65124 | 1 | |
| 0.64938 | 1 | |
| 0.64169 | 1 | |
| 0.64157 | 1 | |
| 0.6404 | 1 |
| Distinct | 157 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1434218354 |
| Minimum | 0 |
|---|---|
| Maximum | 0.55191 |
| Zeros | 1 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.015823 |
| Q1 | 0.061675 |
| median | 0.10722 |
| Q3 | 0.180255 |
| 95-th percentile | 0.401446 |
| Maximum | 0.55191 |
| Range | 0.55191 |
| Interquartile range (IQR) | 0.11858 |
Descriptive statistics
| Standard deviation | 0.1200340736 |
|---|---|
| Coefficient of variation (CV) | 0.8369302568 |
| Kurtosis | 1.384786522 |
| Mean | 0.1434218354 |
| Median Absolute Deviation (MAD) | 0.052555 |
| Skewness | 1.385462595 |
| Sum | 22.66065 |
| Variance | 0.01440817882 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.32524 | 2 | 1.3% |
| 0.39928 | 1 | 0.6% |
| 0.19317 | 1 | 0.6% |
| 0.05327 | 1 | 0.6% |
| 0.07122 | 1 | 0.6% |
| 0.12905 | 1 | 0.6% |
| 0.10062 | 1 | 0.6% |
| 0.17922 | 1 | 0.6% |
| 0.00227 | 1 | 0.6% |
| 0.10501 | 1 | 0.6% |
| Other values (147) | 147 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 0.00227 | 1 | |
| 0.00649 | 1 | |
| 0.00872 | 1 | |
| 0.01031 | 1 | |
| 0.01078 | 1 | |
| 0.0114 | 1 | |
| 0.01397 | 1 | |
| 0.01615 | 1 | |
| 0.02299 | 1 |
| Value | Count | Frequency (%) |
| 0.55191 | 1 | |
| 0.52208 | 1 | |
| 0.4921 | 1 | |
| 0.48357 | 1 | |
| 0.43844 | 1 | |
| 0.42922 | 1 | |
| 0.41978 | 1 | |
| 0.41372 | 1 | |
| 0.39928 | 1 | |
| 0.38583 | 1 |
| Distinct | 158 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2372955063 |
| Minimum | 0 |
|---|---|
| Maximum | 0.79588 |
| Zeros | 1 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.071195 |
| Q1 | 0.1505525 |
| median | 0.21613 |
| Q3 | 0.3098825 |
| 95-th percentile | 0.4751735 |
| Maximum | 0.79588 |
| Range | 0.79588 |
| Interquartile range (IQR) | 0.15933 |
Descriptive statistics
| Standard deviation | 0.126684934 |
|---|---|
| Coefficient of variation (CV) | 0.5338699244 |
| Kurtosis | 1.746527654 |
| Mean | 0.2372955063 |
| Median Absolute Deviation (MAD) | 0.07702 |
| Skewness | 1.001960576 |
| Sum | 37.49269 |
| Variance | 0.01604907251 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.05444 | 1 | 0.6% |
| 0.09131 | 1 | 0.6% |
| 0.25328 | 1 | 0.6% |
| 0.26475 | 1 | 0.6% |
| 0.07172 | 1 | 0.6% |
| 0.11251 | 1 | 0.6% |
| 0.12388 | 1 | 0.6% |
| 0.20951 | 1 | 0.6% |
| 0.18557 | 1 | 0.6% |
| 0.28214 | 1 | 0.6% |
| Other values (148) | 148 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 0.00199 | 1 | |
| 0.02641 | 1 | |
| 0.05444 | 1 | |
| 0.05547 | 1 | |
| 0.05841 | 1 | |
| 0.06431 | 1 | |
| 0.06822 | 1 | |
| 0.07172 | 1 | |
| 0.07799 | 1 |
| Value | Count | Frequency (%) |
| 0.79588 | 1 | |
| 0.5763 | 1 | |
| 0.51912 | 1 | |
| 0.51752 | 1 | |
| 0.51535 | 1 | |
| 0.50318 | 1 | |
| 0.47998 | 1 | |
| 0.4761 | 1 | |
| 0.47501 | 1 | |
| 0.47179 | 1 |
| Distinct | 158 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.098976772 |
| Minimum | 0.32858 |
|---|---|
| Maximum | 3.60214 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 KiB |
Quantile statistics
| Minimum | 0.32858 |
|---|---|
| 5-th percentile | 1.2365865 |
| Q1 | 1.75941 |
| median | 2.095415 |
| Q3 | 2.462415 |
| 95-th percentile | 3.0374555 |
| Maximum | 3.60214 |
| Range | 3.27356 |
| Interquartile range (IQR) | 0.703005 |
Descriptive statistics
| Standard deviation | 0.5535497923 |
|---|---|
| Coefficient of variation (CV) | 0.2637236389 |
| Kurtosis | 0.5341213282 |
| Mean | 2.098976772 |
| Median Absolute Deviation (MAD) | 0.356215 |
| Skewness | -0.2389108094 |
| Sum | 331.63833 |
| Variance | 0.3064173726 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2.77729 | 1 | 0.6% |
| 2.67585 | 1 | 0.6% |
| 1.87996 | 1 | 0.6% |
| 1.59927 | 1 | 0.6% |
| 3.10712 | 1 | 0.6% |
| 1.59888 | 1 | 0.6% |
| 1.6944 | 1 | 0.6% |
| 2.89319 | 1 | 0.6% |
| 2.1309 | 1 | 0.6% |
| 2.05125 | 1 | 0.6% |
| Other values (148) | 148 |
| Value | Count | Frequency (%) |
| 0.32858 | 1 | |
| 0.65429 | 1 | |
| 0.67042 | 1 | |
| 0.67108 | 1 | |
| 0.89991 | 1 | |
| 0.98195 | 1 | |
| 0.99895 | 1 | |
| 1.21305 | 1 | |
| 1.24074 | 1 | |
| 1.26462 | 1 |
| Value | Count | Frequency (%) |
| 3.60214 | 1 | |
| 3.26001 | 1 | |
| 3.19131 | 1 | |
| 3.17728 | 1 | |
| 3.10712 | 1 | |
| 3.10709 | 1 | |
| 3.08854 | 1 | |
| 3.05137 | 1 | |
| 3.035 | 1 | |
| 2.89319 | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| Country | Region | Happiness Rank | Happiness Score | Standard Error | Economy (GDP per Capita) | Family | Health (Life Expectancy) | Freedom | Trust (Government Corruption) | Generosity | Dystopia Residual | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Switzerland | Western Europe | 1 | 7.587 | 0.03411 | 1.39651 | 1.34951 | 0.94143 | 0.66557 | 0.41978 | 0.29678 | 2.51738 |
| 1 | Iceland | Western Europe | 2 | 7.561 | 0.04884 | 1.30232 | 1.40223 | 0.94784 | 0.62877 | 0.14145 | 0.43630 | 2.70201 |
| 2 | Denmark | Western Europe | 3 | 7.527 | 0.03328 | 1.32548 | 1.36058 | 0.87464 | 0.64938 | 0.48357 | 0.34139 | 2.49204 |
| 3 | Norway | Western Europe | 4 | 7.522 | 0.03880 | 1.45900 | 1.33095 | 0.88521 | 0.66973 | 0.36503 | 0.34699 | 2.46531 |
| 4 | Canada | North America | 5 | 7.427 | 0.03553 | 1.32629 | 1.32261 | 0.90563 | 0.63297 | 0.32957 | 0.45811 | 2.45176 |
| 5 | Finland | Western Europe | 6 | 7.406 | 0.03140 | 1.29025 | 1.31826 | 0.88911 | 0.64169 | 0.41372 | 0.23351 | 2.61955 |
| 6 | Netherlands | Western Europe | 7 | 7.378 | 0.02799 | 1.32944 | 1.28017 | 0.89284 | 0.61576 | 0.31814 | 0.47610 | 2.46570 |
| 7 | Sweden | Western Europe | 8 | 7.364 | 0.03157 | 1.33171 | 1.28907 | 0.91087 | 0.65980 | 0.43844 | 0.36262 | 2.37119 |
| 8 | New Zealand | Australia and New Zealand | 9 | 7.286 | 0.03371 | 1.25018 | 1.31967 | 0.90837 | 0.63938 | 0.42922 | 0.47501 | 2.26425 |
| 9 | Australia | Australia and New Zealand | 10 | 7.284 | 0.04083 | 1.33358 | 1.30923 | 0.93156 | 0.65124 | 0.35637 | 0.43562 | 2.26646 |
Last rows
| Country | Region | Happiness Rank | Happiness Score | Standard Error | Economy (GDP per Capita) | Family | Health (Life Expectancy) | Freedom | Trust (Government Corruption) | Generosity | Dystopia Residual | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 148 | Chad | Sub-Saharan Africa | 149 | 3.667 | 0.03830 | 0.34193 | 0.76062 | 0.15010 | 0.23501 | 0.05269 | 0.18386 | 1.94296 |
| 149 | Guinea | Sub-Saharan Africa | 150 | 3.656 | 0.03590 | 0.17417 | 0.46475 | 0.24009 | 0.37725 | 0.12139 | 0.28657 | 1.99172 |
| 150 | Ivory Coast | Sub-Saharan Africa | 151 | 3.655 | 0.05141 | 0.46534 | 0.77115 | 0.15185 | 0.46866 | 0.17922 | 0.20165 | 1.41723 |
| 151 | Burkina Faso | Sub-Saharan Africa | 152 | 3.587 | 0.04324 | 0.25812 | 0.85188 | 0.27125 | 0.39493 | 0.12832 | 0.21747 | 1.46494 |
| 152 | Afghanistan | Southern Asia | 153 | 3.575 | 0.03084 | 0.31982 | 0.30285 | 0.30335 | 0.23414 | 0.09719 | 0.36510 | 1.95210 |
| 153 | Rwanda | Sub-Saharan Africa | 154 | 3.465 | 0.03464 | 0.22208 | 0.77370 | 0.42864 | 0.59201 | 0.55191 | 0.22628 | 0.67042 |
| 154 | Benin | Sub-Saharan Africa | 155 | 3.340 | 0.03656 | 0.28665 | 0.35386 | 0.31910 | 0.48450 | 0.08010 | 0.18260 | 1.63328 |
| 155 | Syria | Middle East and Northern Africa | 156 | 3.006 | 0.05015 | 0.66320 | 0.47489 | 0.72193 | 0.15684 | 0.18906 | 0.47179 | 0.32858 |
| 156 | Burundi | Sub-Saharan Africa | 157 | 2.905 | 0.08658 | 0.01530 | 0.41587 | 0.22396 | 0.11850 | 0.10062 | 0.19727 | 1.83302 |
| 157 | Togo | Sub-Saharan Africa | 158 | 2.839 | 0.06727 | 0.20868 | 0.13995 | 0.28443 | 0.36453 | 0.10731 | 0.16681 | 1.56726 |